LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 
ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



Craniata ; Vertebrata ; Eut eleostomi ; 
Catarrhini; Hominidae; Homo. 



BG529820 826 bp mRNA linear EST 03-APR-2001 

602558904F1 NIH_MGC_61 Homo sapiens cDNA clone IMAGE : 4696752 5', 
mRNA sequence. 
BG529820 

BG529820 .1 GI : 13521357 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates; 
1 (bases 1 to 826) 
NIH-MGC http : //mgc . nci . nih . gov/ . 

National Institutes of Health, Mammalian Gene Collection (MGC) 

Unpublished (1999) 

Contact: Robert Strausberg, Ph.D. 

Ema i 1 : cgapbs - r@ma i 1 . nih . gov 

Tissue Procurement: ATCC 

cDNA Library Preparation: CLONETECH Laboratories, Inc. 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 

DNA Sequencing by: Incyte Genomics, Inc. 

Clone distribution: MGC clone distribution information can be 
found through the I.M.A.G.E. Consort ium/LLNL at: 
http: //image. llnl .gov 
Plate: LLCM1524 row: h column: 01 
High quality sequence stop: 820. 

Location/Qualifiers 

1. .826 

/organism="Homo sapiens" 

/mol_type= "mRNA" 

/db_xref ="taxon: 9606" 

/clone= " IMAGE : 4696752 " 

/ 1 issue_type= " embryonal carcinoma " 

/lab_host="DH10B (Tl phage-resistant) " 

/ cl one_l ib= " NI H JV1GC_6 1 11 

/note="Organ: testis; Vector: pDNR-LIB (Clontech) ; Site_l: 
Sfil (ggccgcctcggcc) ; Site_2: Sfil (ggccattatggcc) ; 
Double- stranded cDNA was prepared from cell line RNA. 5' 
and 3* adaptors were used in cloning as follows: 5 ! 
adaptor sequence: 5 ' -CACGGCCATTATGGCC-3 1 and 3' adaptor 
sequence : 5 » -ATTCTAGAGGCCGAGGCGGCCGACATG-dT ( 3 0 ) BN- 3 ■ 
(where B = A, C, or G and N = A, C, G, or T) . Average 
insert size 1.75 kb (range 0.9-4.0 kb) . 15/15 colonies 
contained inserts by PCR. This library was enriched for 
full-length clones and was constructed by Clontech 
Laboratories (Palo Alto, CA) . Note: this is a NIH_MGC 
Library. " 



ORIGIN 



Query Match 44.8%; 
Best Local Similarity 97.6%; 
Matches 798; Conservative 



Score 733.6; DB 12; 
Pred. No. 3.5e-161; 
0; Mismatches 14; 



Length 826; 



Indels 



6 ; Gaps 



5; 



Qy 
Db 



1 GAGGAGCXjGGCCGAGGACTCCAGCGTGCCCAGGTCTGGCATCCTGC^ 60 

I II I II 1 1 1 1 1 1 1! ! 1 1 M 1 1 1 II 1 1 1 1 1 1 II I II 1 1 1 II 1 1 1 II I II II I II 1 1 1 II 1 1 

11 GAGGAGCGGGCCGAGGACTCCAGCGTGCCCAGGTCTGGCATCCTGCACTTGCTGCCCTCT 70 



Qy 



61 GAC^CCTGGGAAGATGGCCGGCCCGTGGACCTTCACCCTTCTCTGTGGTTTGCTGGCAGC 120 



IIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIMIIIIIIIIMIIIIIIIII 

71 GACACCTGGGAAGATGGCCGGCCCGTGGACCTTCACCCTTCT 



Db 


71 


GAOVCCTGGGAAGATGGCCGGCCCGTGGACCTTCACCCTTCTCTGTGGTTTGCTGGCAGC 


130 


Qy 


121 


<^CCTTGATCCAAGC(^CCCTC^^ 


180 




1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 M M 1 1 1 1 M 1 1 1 1 If 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 M 1 1 1 1 1 1 1 II 1 1 1 1 1 




Db 


131 


CACCTTGATCC^GCCACCCTCAGTCCC^CTGCAGTTCTCATCCTCGGCCCAAAAGTCAT 


190 


Qy 


181 


CAAAGAAAAGCTGAC^ 


240 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I M 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 M 1 1 1 M 1 1 1 1 1 i 1 1 




Db 


191 


CAAAGAAAAGCTGACACAGGAG 


250 


Qy 


241 


GCCGCTGCTCAGTGCCATGCGGGAAAAGCCAGCCGGA 


300 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 II II II 1 1 1 II It 1 1 II 1 1 1 1 II II 




Db 


251 


GCCGCTGCTCAGTGCCATGCGGGAAAAGCCAGCCGGAGGCATCCCTGTGCTGGGCAGCCT 


310 


Qy 


301 


GGTGAACACCGTCCTGAAGCACATCATCTGGCTGAAGGTCATCACAGCTAACATCCTCCA 


360 




1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 II 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


311 


GGTGAAC^CCGTCCTGAAGCACATCATCTGGCTGAAGGTCATCACAGCTAACATCCTCCA 


370 


Qy 


361 


GCTGCAGK3TGAAGCCCTCGGCCAATGACCAGGAGCTGCTAGT 


420 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 II 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


371 


GCTGCAGGTGAAGCCCTCGGCCAATGACCAGGAGCTGCTAGTCAAGATCCCCCTGGACAT 


430 


Qy 


421 


GGTGGCTGGATTCAACACGCCCCTGGTC^GACC^TCGTGGAGTTCCAC^TGACmCTGA 


480 




1 1 1 1 1 1 1 1 1 1 1 1 11 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 II II 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


431 


GGTGGCTGGATTCAACACGCCCCTGGTCAAGACCATCGTGGAGTTCCACATGACGACTGA 


490 


Qy 


481 


GGCC(^GCCAC(^TCCGCATGGACACC^GTGCAAGTGGCCCC^CCCGCCTGGTCCTC^G 


540 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 II 11 1 1 II 1 1 II II 1 1 1 1 II 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 11 1 1 1 1 1 1 II 1 II 1 1 1 




Db 


491 


GGCCC^GCCACCATCCGCATGGACACC^GTGCAAGTGGCCCCACCCGCCTGGTCCTCAG 


550 


Qy 


541 


TGACTGTGCCACCAGCCATGGGAGCCTGCGCATCCAACTGCTGTATAAGCTCTCCTTCCT 


600 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 llllllllllllllll 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 llllllllllllllll 




Db 


551 


TGACTGTGCCACCAGCCATGGGAGCCTGCGCATC 


610 


Qy 


601 


GGTGAACGCCTT-AGCTAAGCAGGTCATGAACCTCCTAGTGCCATCCCTGCCCAATCTAG 


659 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 II 1 1 




Db 


611 


GGTGAACGCCTTAAGCTAAGCAGGTCATGAACCTCCTAGTGCCATCCCTGCCCAATCTAG 


670 


Qy 


660 


TGAAAAACCAGCTGTGTCCC - -GTGATCGAGGCTTCCTTCAATGGCATGTATGCAGACCT 


717 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


671 


TGAAAAACCAGCTGTGTCCCCGTGAATCGAGGCTTCCTTCAATGGCATGTATGCAGACCT 


730 


Qy 


718 


-CCTGCAGCTGGTGAAGGTGCCCATTTCCCTCAGCATTGACCGTCTGGAGTTTGACCTTC 


776 




IIIIIIII MM 1 Ml; 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 IIIIIIII 1 1 1 1 1 1 1 




Db 


731 


CCCTGCAGCTGGTGAAGGTGCCATTTTCCCTCAGCATTGACCG - CTGGAGTTGGACCTTC 


789 


Qy 


777 


TGTATCCTGCCATCAAGGGTGACACCATTCAGCTCTAC 814 






1 IIIIIIII IIIIIIII IIIIIIII Mill 




Db 


790 


TTGTATCTGCCATC - AGGGTGACCCCATTCAGTTCTAC 826 





